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Consider nondeterministic finite automata recognizing base-fc positional notation of numbers. As- 
sume that numbers are read starting from their least significant digits. It is proved that if two 
sets of numbers S and T are represented by nondeterministic automata of rn and n states, respec- 
tively, then their sum {s + 1 | s S S, t € T} is represented by a nondeterministic automaton with 
2mn+2m+2n + 1 states. Moreover, this number of states is necessary in the worst case for all k ^9. 

1 Introduction 

Descriptional complexity of operations on regular languages with respect to their representation by finite 
automata and regular expressions is among the common topics of automata theory. With respect to 
deterministic finite automata (DFAs), and using the number of states as a complexity measure, the state 
complexity of basic operations on languages was determined by Maslov lITO in 1970. In particular, 
such results as "if languages K and L are recognized by DFAs of m and n states, respectively, then the 
language KL requires a DFA with up to (2m — l)2 n ~ l states" originate from that paper. 

Over the last two decades, similar results were obtained for nondeterministic finite automata (NFAs). 
In particular, Birget [2] has shown that the complement of a language recognized by an n-state NFA 
may require an NFA with as many as 2 n states, and this result was later improved by Jiraskova [9] who 
reduced the alphabet for the witness language from {a,b, c, d} to {a, b}. The systematic study of non- 
deterministic state complexity, that is, state complexity with respect to NFAs, of different operations 
was started by Holzer and Kutrib (H, who obtained, in particular, the precise results for union, inter- 
section and concatenation. More recently Jiraskova and Okhotin ifTOl determined the nondeterministic 
state complexity of cyclic shift, Gruber and Holzer [4] established precise results for scattered substtings 
and scattered superstrings, Domaratzki and Okhotin |3] studied A;-th power of a language, L k , while 
Han, K. Salomaa and Wood [5 ] considered the standard operations on NFAs in the context of prefix-free 
languages. 

The present paper continues this study by investigating another operation, which has recently been 
used by Jez and Okhotin (7J HI in the study of language equations. This is the operation of addition of 
strings in base-A; positional notation. Let = {0, 1, . . . , k — 1} with k ^ 2 be an alphabet of digits. Then 
a string a^_i ■ ■ ■ ao € L* k represents a number (ci£_i • • • ao) k = Li=d a i ' an( ^ there is a correspondence 
between natural numbers and strings in Z£ \ 0Z£- F° r two strings u, v e \ 0£^, their sum can be defined 
as w = u EE v as the unique string w € ~L* k \ 0Z£, for which (w) k = (u) k + (v) k . The operation extends 
to languages as follows: for all K,LCI* k \ 0££, K EE L = {u EE v | u G K, v G L}. 
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This operation preserves regularity, and proving that can be regarded as an exercise in automata 
theory. The paper begins with a solution to this exercise, given in Section 12 For convenience, it is 
assumed that automata read a notation of a number starting from its least significant digit; to put it 
formally, a slightly different operation is studied: KS R L = (K R E\ L R ) R . This variant seems to be 
more natural in the context of automata, and furthermore, since the nondeterministic state complexity of 
reversal is n + 1, the complexity of these two operations is almost the same. 

The straightforward construction of an automaton recognizing the language L(A) 5i R L(B) for an 
m-state NFA A and an n-state NFA B yields an NFA with 2mn + 2m + 2n + 1 states. The purpose of 
this paper is to show that this construction is in fact optimal, and there are witness languages, for which 
exactly this number of states is required. This is established in Section |3l where worst-case automata 
are presented for m, n ^ 1 with m + n ^ 3. The case of m = n = 1 requires a special treatment, and it 
is proved that the NFA recognizing a positional sum of two one-state automata requires 6 states in the 
worst case. 



A nondeterministic finite automaton (NFA) is a quintuple A = (Q,L,5,qo,F), in which Q is a finite set 
of states, E is a finite input alphabet, 5 : Q x E — > 2^ is the (nondeterministic) transition function, go £ Q 
is the initial state, and F C Q is the set of accepting states. An NFA is called a deterministic finite 
automaton (DFA) if |5(g,a)| = 1 for all q and a, and it is a partial DFA if |J(g,o)| ^ 1. The transition 
function can be naturally extended to the domain Q x E*. The language recognized by the NFA A, 
denoted L(A), is the set {wel* S(q ,w) 

Throughout this paper, the letters in an alphabet of size k are always considered as digits in base-A; 
notation, and the alphabet is E& = {0, 1, . . . , k — 1}. With such an alphabet fixed, the nondeterministic 
state complexity of positional addition of NFAs is defined as a function : N x N — > N, where fk{m, n) 
is the least number of states in an NFA sufficient to represent L(A) 5i R L(B) for every m-state NFA A 
and n-state NFA B with L(A),L(B) C Ej£\0E£. The following lemma, besides formally establishing 
that regular languages are closed under addition in positional notation, gives an upper bound on this 
function. 

Lemma 1. Let A and B be NFAs over E^ = {0, l,...,k— 1} with m and n states, respectively. Let 
L(A) n 0E£ = L(B) n 0E£ = 0. Then there exists a (2mn + 2m + 2n + \)-state NFA over E fc for the 
language L(A) S R L(B). 

Proof: Let A = (P,Lk,SA,Po,FA) an d B = {Q,^k,SB,qo,FB). The new NFA C has a set of states split 
into four groups: Q = Q AB UQ A UQ B U {q acc }, where 



(I) Each state (p,q,c) € Q corresponds to A in state p, B in state q and carry digit c € {0, 1}. In 
particular, the state (po><?o>0) is the initial state of this NFA. State (p,q,c) represents the case shown in 
Figure QJleft). A string of digits ddddd has been read, and C has guessed its representation as a sum of 
two strings of digits, aaaaaS R bbbbb, where A goes to p by aaaaa and B goes to q by bbbbb. If c = 1, 
then aaaaaS R bbbbb = Iddddd. 



2 Constructing an NFA for Km K L 



Q 



,AB 



PxQx{0,l}, 

{^}xPx{0,l}, 

{B}xQx{0,l}. 
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The transitions from one state of this kind to another are defined as follows. Suppose A reads a digit a 
and goes from p to p', while B may go from q to q' by a digit b. Then, taking the carry digit c into account, 
the sum may contain a digit a + b + c or a + b + c — kin this position depending on whether a+b+c < k 
or not, and also the carry should be adjusted accordingly. Thus C has a transition from (p, q, c) to to 
(p', q', 0) by a + b + c if a + b + c < k, or a transition to (p', </, 1) by a + 6 + c — /c if a+b+c^ k. This 
procedure continues until the string of digits recognized by A or by B finishes. Then C enters a state of 
one of the following two groups. 



■)AB 



QA QB 



A(p) ° A(p) ° 

-< — ^ — a a a a a -< — ^ — a a a a a a a a 

< B(g) b b b b b b b b < B(g) b b b b b 



d d d d d d d d d d d d d d d 

Figure 1: Transitions out of (p,q,0) & Q AB in the constructed NFA. 



(II) If the automaton B is no longer running (that is, the notation of the second number has ended), 
while A still produces some digits, this case is implemented in states (A,p,c) G Q A , where p is a state 
of A and c is a carry. This case is illustrated in Figure QJmiddle). The NFA C reaches this group of states 
as follows. For every state (p,q,c) G Q AB , such that q is an accepting state of B, the string recognized 
by B can be pronounced finished. Suppose that A may go from p to p' by a digit a. Then the sum may 
contain a digit a + cora + c — k. This case is represented by a transition of C from (p, q, c) to (A,j/, 0) 
by a + c if a + c < k, or to (A,p ! , 1) by a + c — k if a + c^ k. Once C enters the subset Q A , it can 
continue reading the number as follows. For every state (A,p,c), if A may go from p to p' by a digit a, 
then there is a transition from (.A,p,c) to (A,p',0) by (o + c) if a + c < k, or to (A,p\ 1) by (a + c — fc) 
if a + c ^ fc. 

(III) Symmetrically, there is a group of states (B,q, c), which correspond to the case when the number 
read by A has ended. For each state (p, q, c) G Q AB with p G F^, for every digit b and for every state q', 
such that i? has a transition from g to q' by 6, the new automaton C has a transition from (p,q,c) to 
(B,q',0) by 6 + c if 6 + c < fc, or to (B,q',l) by 6 + c — if b + c ^ fc. Second, for every state (B,q,c), 
if B may go from q to g' by a digit b, then C has a transition from (B,q,c) to (B,q',0) by (b + c) if 
6 + c < fe, or to (£,<?', 1) by (b + c- k) if b + c^ k. 

(IV) (? acc is a special accepting state with no outgoing transitions. This state is needed when the 
strings of digits recognized by A and B have already finished, but the carry digit remains, and thus an 
extra input symbol has to be read. The automaton C reaches this state by reading the digit 1 under the 
following conditions: for all p G Fa and q G Fb, there are transitions by 1 from (p, q, 1), from (A,pA) 
and from (B,q,l) to q acc . 

The other accepting states are all states of the form (p,q,0), (A,p,0) and (B,q,0), with p G Fa 
and q G Fb- 

This completes the construction. The general form of transitions from a state (p,q,c) G Q AB is 
illustrated in Figure |2j separately for c = and c = 1 . □ 
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Figure 2: Transitions out of (p,q,0) and out of (p,q, 1) in the constructed NFA. 

3 Lower bounds 

The goal of the paper is to prove that the 2mn + 2m + 2n + 1 bound of Lemma[T]is tight. As this requires 
a rather difficult proof, the following weaker result will be established first. 

Lemma 2. Let = {0,1,..., A; — 1} be an alphabet with k ^ 2. Let m,n ^ 1 be relatively prime 
numbers and consider languages L m = (i m )* and L n = (l n )*, which are representable by NFAs of m 
and n states, respectively. Then every NFA recognizing the language L m E\ R L n has at least ran states. 

Proof: Let A be an NFA for L m 5i R L n with I states. If k ^ 3, construct a new Estate NFA B recognizing 
(L m E\ R L n ) n 2* which can be done by taking the NFA A and omitting transitions by all symbols except 
for 2. Then L{B) = (2 mn )* . This is a language that requires an NFA of at least mn states. Therefore, 
I ^ mn. In the case of k = 2, let B recognize (L m S R L n ) flOl*. In this case it is sufficient to have 
£+1 states in B, and L(B) = 0(l mn )*. As this language requires an NFA with at least mn+ 1 states, 
the statement is proved. □ 

In order to prove a precise lower bound, a different construction of witness languages is needed. At 
present, the witness languages are defined over an alphabet of at least nine symbols, that is, the bound 
applies to addition in base 9 or greater. Lower bounds on the resulting languages of sums will be proved 
using the well-known fooling-set lower bound technique. After defining a fooling set we recall the lemma 
describing the technique, and give a small example. Then, the lower bound result follows. 

Definition 3. A set of pairs of strings {(xi,yi) \ i= 1,2, . .. ,n} is said to be a. fooling set for a language L 
if for every i and j in {1,2, . . . ,n}, 

(Fl) the string x-iUi is in the language L, 

(F2) if i ^ j, then at least one of the strings XiUj and XjHi is not in L. 

Lemma 4 (Birget [1|). Let A be a fooling set for a regular language L. Then every NFA recognizing 
the language L requires at least \A\ states. 
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Example 5. Consider the regular language L = {w € L* | the number of a's in w is a multiple of n}. 
The set of pairs of strings {(a,a n_1 ), (a 2 ,a n_2 ), . . . , (a n ,e)} is a fooling set for the language L because 
for every % and j in {1,2, . . . ,n}, 

(Fl) a l a n ~ % = a n , and the string a n is in the language L, and 

(F2) if 1 < i < j < n, then a*a n ~ J = a n ~^~ l \ and the string a n ~V~ 1 ' is not in the language L since 

< n — (j — i) < n. 

Hence by LemmaH] every NFA for the language L needs at least n states. o 

Lemma 6. Let = {0,1,. .. ,k — 1} be an alphabet with k^9. Let m ^ 1 and n^2, and consider the 
partial DFAs A m and B n over given in Figure^} Then every NFA for L(A m ) EB^ L(B n ) has at least 
Iran + 2m + In + 1 states. 

Proof: In plain words, L{A m ) represents all numbers with their base-fc notation using only digits 1, 
2 and k — 1, with the number of Is equal to m — 1 modulo m. Similarly, the base-A; notation of all 
numbers in L(B n ) uses only digits 1, 3 and k — 1, and the total number of Is and (k — l)s should be 
n — 1 modulo n. 




Figure 3: The nondeterministic finite automata A m and B n over = {0, 1, . . . , k — 1} with k^9. 

Let the set of states of A m be P = {0, . . . , m — 1 } and let the states of B n be Q = {0, . . . , n — 1 }. Let 
L = L(A m )S R L(B n ), and let us construct a (2mn + 2m + 2n + 1 ) -state NFA 

M = (Q AB UQ A UQ B U {q acc },Z k ,5, q ,F) 

for the language L as in Lemma Q] The initial state of M is qo = (0,0,0). The full set of transitions 
is omitted due to space constraints; the reader can reconstruct it according to Lemma Q] The below 
incomplete list represents all information about M used later in the proof: 

• Each state (i,j,0) goes to itself by 5; to state (i, j ; + 1,0) by 3; to state (i + 1 , j, 0) by 4, and to state 

+ 1, 1) by k — 2. Each state (m — 1, j,0) also goes to state (B,j,0) by 3. 

• Each state 1) goes to state (i,j,0) by 6. Each state (i,n— 1,1) also goes to state (A, 1) by 0, 
and each state (m — l,j, 1) also goes to state (B,j + 1, 1) by 0. 

• Each state (A,i, 1) goes to itself by 0; to state (A,i,0) by 3; and to state (A,i+ 1,0) by 2. 

• Each state (A, i,0) goes to itself by 2 and k — 1; and to (A, i + 1 , 0) by 1. 
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• Each state (B,j, 1) goes to state (B,j + 1, 1) by 0; to state (B, j,0) by 4; and to state (B,j + 1,0) 
by 2. 

• Each state (B,j,0) goes to itself by 3; and to (B,j + 1,0) by 1 and k— 1. 

• State (A,m — 1,1) goes to state q acc by 1. 

Notice that in states (A,i,c) and (B,j,c), transitions by 5 and by 6 are not defined, and no transitions 
are defined in state q acc . There are four accepting states: (m — n,n — 1,0), (A,m— 1,0), (B,n — 1,0) 
and q acc . Transitions from (i,j,0) and 1) are illustrated in Figure 01 where transitions not used in 
the proof are shown in grey. 



I J+l 




c 


=1 \ \ : 




\ N« 




















c 


=0 



J j+1 




Figure 4: NFA M: transitions out of states (i,j,0) and 

Our goal is to show that every NFA for the language L requires at least 2mn + 2m + 2n + 1 states. 
We prove this by describing a fooling set for the language L of size 2mn + 2m + 2n+ 1. Consider the 
following sets of pairs of strings, in which the difference j — 1 is modulo n (that is, j — 1 = n — 1 for 
i = 0): 

A = {(4 i 3 i ,54 m - 1 - i 3 n - 1 - J 5) | i = 0, 1, . . . ,m - 1, j = 0, 1, . . . ,n - 1}, 

B = {(A i 3 j - 1 (k-2),64 m - l - i 3 n ~ l - j 5) \i = 0, 1, . . . ,m - 1, j = 0, 1, . . . ,n - 1}, 



C={{A l 3 n - 1 {k- 2)0,31 



m—l—i 



22) 



0,l,...,m-l}U 



{(4 i 3 n ~ 2 (A; - 2)03, l m - 1 - i 22) | i = 0, 1, . . . ,m- 1}, 
V = {(4 m - l 3 n ~\k - 2)00 j , n " 1 ^41 rt - 1 33) | j = 0, 1, . . . ,n - 1} U 
{( 4 m-l 3 ™-l(& _ 2 )0 n 41 J ', l"- 1 -^) | j = 0, 1, . .. ,n- 1}. 

Let jF = ^Ui3uCUP. Let us show that the set JF is a fooling set for L, that is, 
(Fl) for each pair (x,y) in JF, the string xy is in L; 

(F2) if (x,y) and (u,u) are two different pairs in J 7 , then xv ^ L or uy ^ L. 
We prove the statement (Fl) by examination of each pair: 
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• If (x,y) is a pair in A, then xy = 4 i 3 3 '54 OT - 1_i 3 n-1 ~''5. The initial state (0,0,0) of M goes to 
state (i,j,0) by 4*3 J , which goes to itself by 5, and then to the accepting state [m— l,n — 1,0) by 
^m-i-jgn-i-jg Thus xy is accepted by M, and so is in L. This case is illustrated in Figured 
left. 

• If (x,y) is a pair in B, then xy = 4 i 3^ 1 (/c - 2)64 m " 1 ^3 n " 1 ^5. State (0,0,0) goes to state 

— 1,0) by 4 i 3 J " _ , which goes to state (z, j, 1) by A; — 2. State (i, j, 1) goes to state (i,j,0) by 6, 
and then to the accepting state [m — l,n— 1,0) by 4 m ~ 1 ~ l 3™ _1_:, 5, which is shown in Figured 
right. 




(U,l) 



Figure 5: A pair in A and a pair in B. 

• If (x,y) is a pair in C, then xy = 4 i 3 n " 2 (/c-2)031 m - 1 - i 22. State (0,0,0) goes to state (i,n- 1, 1) 
by 4 i 3 n_2 (A; - 2), which goes to state (A,i,l) by 0, and then to state (A,i,0) by 3, and to the 
accepting state (A,m — 1,0) by l m ~ l ~ l 22. This computation path is presented in Figure[6l left. 

• If (x,y) isapair in P, then xy = 4 m_1 3 n_1 (fc-2)0 n 41 n ~ 1 33. State (0,0,0) goes to (m- 1,0,1) 
by 4 m - 1 3 n ~ 1 (£; - 2), which goes to state (B, 1, 1) by 0, and then to state (B,0, 1) by n_1 , and to 
state (£?,0,0) by 4, and to the accepting state (B,n — 1,0) by l n_1 33, as shown in Figure |6l right. 

Thus in all four cases, the string xy is accepted by the NFA M, and so is in the language L. This proves 
(Fl). To prove (F2) let us consider the following seven cases: 

• If (x,y) and (u,v) are two different pairs in A, then 

(x,y) = (4 i 3 J ',54 m - 1 - i 3"- 1 ^5) and (u,v) = (4 r 3 s ,54 m - 1 ~ r 3"- 1 - s 5), 

where ^ (r,s). Consider the string xv = 4 l 3- J 54 m-1 ~ r 3 n ~ 1 ~ ,s 5. Since the digit 5 cannot be 
read in any state (B,p,0), after reading xv, the NFA M may only be in state 

(m— 1 — r + i,n — 1 — s+j,0). 

This state is rejecting if i ^ r or j ^ s. So the string xv is not in L. 

• If (x,y) is a pair in A and (u,v) is a pair in B, then x = 4*3 J and v = 6w for a string w. After 
reading x, the NFA M is either in state (i,j,0) or in a state (B,p,0). In these states, transitions 
by 6 are not defined. Thus the string xv is rejected by M, and so is not in L. 
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(B,0,0) (B,1,0) (B,n-1,0) 



Figure 6: A pair in C and a pair in V. 

If (x, y) is a pair in A U B, and (u, v) is a pair in C U V, then y = bw or y = 6w for a string w. Let 
us show that the string uy is not in L. Notice that after reading the string u, the NFA M is either 
in a state (A,p,c) or in a state (B,q,c). In these states, no transitions by 5 and by 6 are defined. 
Therefore, the string uy is not in L. 

If (x,y) and (u,v) are two different pairs in B, then (x,y) = (4 i 3 J '- 1 (fc-2),64 m - 1 - i 3 n_1 - J '5) 
and (u,v) = (4 r 3 s_1 (A;-2),64 m - 1 - r 3 n - 1 - s 5), where ^ (r,s). After reading x, the nfa M 
may only be in state 1); notice that transitions by k — 2 are not defined in states (B,q,0). State 
1) goes to state (i, j,0) by 6. From this state, by reading 4 m ~ 1 ~ r 3 n-1-s 5, the NFA may only 
reach the rejecting state (m — 1 — r + i,n — 1 — s + j,0). Hence the string xv is not in L. 

If (x,y) and are two different pairs in C, then we have three subcases: 

- (x,y) = (4 i 3"- 2 (A;-2)0,31 m ~ 1 " i 22) and 

(u,v) = (A r 3 n - 2 (k -2)0,31 m " 1 - r 22), where 0^i<r^m-l. 

After reading x, the NFA M is in state (A,i, 1), which goes to state (A,i,0) by 3, and then 
to rejecting state (A,m — 1 —r + i, 0) by l m_1_r 22. Thus is not in L. 

- (x,y) = (4 i 3 n - 2 (/c-2)03,l m - 1 ^22) and 

(u,v) = (V3 n - 2 (k - 2)03, l m ~ l ~ r 22), where 0^i<r^m-l. 

After reading x, the NFA is in state (A,i,0), which goes to rejecting state (A,m— 1 — r + i,0) 
by l m - 1 - r 22. Thus xt> is not in L. 

- (x,y) = (4 i 3 n " 2 (A:-2)0,31 m - 1 - i 22) and 

= (4 r 3 n - 2 (fc-2)03,l m - 1 - r 22). 
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After reading u, the NFA may only be in state (A,r,0), where it cannot read symbol 3. Thus 
uy is not in L. 

• If (x,y) is a pair in C, and (u, v) is a pair in V, then y = u>22 for a string w. Consider the string uy. 
After reading it, the NFA may only be in a state from Q B (notice that n ^ 2). By reading w, it 
either hangs, or remains in Q B , and then cannot read 22. Therefore, uy is not in L. 

• If (x,y) and (it,i>) are two different pairs in V, then there are three subcases again: 

- (x,y) = (4 m - 1 3 n - 1 (/c-2)00^0 n - 1 -- ? 41 n - 1 33) and 
(u,v) = (4 m - 1 3 n ~ 1 (A:-2)00 s ,0 n ~ 1 ^ s 41 n - 1 33), where 

0^j<s^n — 1. Since n ^ 2, state (m — 1,0, 1) only goes to state (B, 1,1) by 0. After 
reading x, the NFA is in state (B , j '< + 1 , 1 ) , which goes to rejecting state (B,n — 1 — s + j, 0) 
by n_1 " s 41"" 1 33. Thus iu is not in L. 

- (x,y) = (4 m - 1 3 n - 1 (A;-2)0 n 4P',l n - 1 -J33) and 

(u,v) = (4 m ~ 1 3 n_1 (A;-2)0 n 41 s ,l n " 1 " s 33), where 0^j<s^n-l. After reading x, 
the NFA is in state (B,j,0), which goes to rejecting state (£>,n — 1 — s + j,0) by l n_1 ~ s 33. 
Thus xv is not in L. 

- (x,y) = (4 m - 1 3 n - 1 (A;-2)00^,0 n - 1 ^41 n - 1 33) and 
(u,v) = (4 m - 1 3 n - 1 (A:-2)0 n 41 s ,l n - 1 - s 33). 

After reading x, the NFA M is in state (B,j + 1,1), where it can read neither 1 nor 3. 
Thus xv is not in L. 

We have shown (F2), which means that the set is a fooling set for the language L. Consider one more 
pair (4 m ~ 1 3 n ~ 2 (k — 2)0 1, e). The NFA M may only be in the accepting state q acc after reading the string 
4 m_1 3 n ~ 2 (£; — 2)01. Since in this state no transitions are defined, and the second part of each pair in T 
is nonempty, the set 

^U{(4 m - 1 3 n - 2 (A;-2)01,e)} 

is a fooling set for the language L of size 2mn + 2m + 2n + 1 . This means that every NFA for the 
language L requires at least 2mn + 2m + 2n + 1 states. □ 

The above lower bound is not applicable, in the case of a pair of one-state automata. In fact, in this 
special case the complexity of this operation is lower. While Lemma Q] gives an upper bound of 7 states 
for this case, 6 states are actually sufficient. 

Lemma 7. Let A and B be two l-state NFAs over an alphabet Then the language L(A) S\ R L(B) is 
representable by an NFA with 6 states. 

Proof: Note that these l-state NFAs must be partial DFAs. Following the notation of Lemma [6l let 
denote the state in the NFA A, as well as the state in the NFA B. If NFA A has no transition on k — 1, 
then state (A,0,l) cannot be reached; similarly for NFA B and state (B,0, 1). If both A and B have 
transitions by k — 1, then states (A,0, 1) and (5,0, 1) can be merged into a state goi> which goes by to 
itself, by a symbol a + 1 to state (A, 0,0) if the NFA A has a transition by a, by a symbol 6 + 1 to state 
(5,0, 0) if the NFA B has a transition by b, for all a, b in L k \ {k - 1}. □ 

The next lemma establishes a matching lower bound of 6 states. 

Lemma 8. Let = {0, 1, . . . , k — 1} be an alphabet with k ^ 9, and consider 1 -state partial DFAs A 
and B over which accept languages {2,k — 1}* and {3,k — 1}*, respectively. Then every NFA for 
L(A) E\ R L(B) has at least 6 states. 
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Proof: Let L = L(A) S R L(B). Let the state in the NFA A as well as the state in the NFA B be denoted 
by 0. Consider a six-state NFA for the language L defined in Lemma|7J with the states (0,0,0), (0,0, 1), 
got, (A, 0,0), (5,0,0) and q acc . The transitions of this automaton are shown in Figure[7] Let 




Figure 7: The 1-state NFAs A and B, and the 6-state NFA for L(A) m R L(B). 



A = {(e, 5), (k- 2,6), ((&- 2)0, 32), ((A; -2)03, 2), ((fc- 2)04, 3), ({k -2)01, e)}, 

and let us show that this set is a fooling set for the language L. Since the strings 5, (k — 2)6, (k — 2)032, 
(k — 2)043, and (k — 2)01 are accepted by the NFA, the statement (Fl) holds for A. On the other hand, 
the following strings are not accepted by this NFA: the string 6, any string starting with (A; — 2)0 and 
ending with 5 or with 6, the strings (k- 2)033, (k -2)0432, (A; -2)042, and any string (k-2)0\w with 
w ^ e. This means that the statement (F2) also holds for A. Hence A is a fooling set for the language L, 
and so every NFA for this language needs at least 6 states. □ 

Putting together all the above lemmata, the following result is obtained. 

Theorem 9. For every k^9, the nondeterministic state complexity of positional addition is given by the 
function 

{6, if m = n = 1, 

2mn + 2m + 2n+l, ifm + n^3. 

An obvious question left open in this paper is the state complexity of positional addition with respect 
to deterministic finite automata. A straightforward upper bound is given by 2 2mn+2m+2n+l , though 
calculations show that for small values of k,m,n this bound is not reached. Though the exact values 
of this complexity function might involve too difficult combinatorics, determining its asymptotics is an 
interesting problem, which is proposed for future study. 
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